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1 Introduction 

We recall that a (degree-d) polynomial threshold function (or PTF) is a function of the form 
f{x) = sgn(p{x)) for some fixed (degree-d) polynomial p. Polynomial threshold functions have 
found application in many areas of computer science, but many fundamental questions about 
them remain open. Perhaps one of the longest standing of these problems is that of bounding 
the sensitivity of such functions. This question was first considered in detail in [6j where it was 
conjectured that: 

Conjecture 1 (Gotsman-Linial Conjecture). Let f be a degree-d polynomial threshold function in 
n > 1 variables, then it's average sensitivity (for the definition of average sensitivity see Section 
\2.3\) is bounded by 

d-l 



AS(/)<2-»«|;(^^j_^_;j^^j)(„-L(n-.)/2J). 



It should be noted that if Conjecture [T] holds, then the stated bound would in fact be tight 
for / defined by the product of the linear polynomials that cut through the middle d layers of the 
hypercube. It is also of interest to note the asymptotics of the bound given in Conjecture [H In 
particular, for n ^ the upper bound given is @{d^/n). Furthermore, by results in [7] and [9], 
Conjecture [1] would also imply asymptotically tight bounds for several other measures of sensitivity. 

In this work, we prove a new bound on the average sensitivity of a polynomial threshold function 
and show in particular that for fixed degree that the exponent of n given by Conjectured] is correct. 

Theorem 2. Let f be a degree-d polynomial threshold function in n > 1 variables, then 

A§(/) < V^(log(n))0('='i°gW)20('=''i°sW). 

Again by reductions from [7J and [9j , this would also imply new bounds on the noise sensitivity 
and Gaussian average sensitivity of polynomial threshold functions. Namely, 

Corollary 3. For f a degree-d polynomial threshold function in n > 1 variables, and for 1/2 > 
6 > 0, then 

msif) = V5(log((5-^))^('^^°s('^))2°('='''°*?('^)), 

and 

,§(/) = V^(log(n))«('^i°sW)20(rf^iogW). 
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1.1 Previous Work 



Proving the conjectured bounds for the various notions of sensitivity has proved to be quite difficult. 
The degree-1 case of Conjecture [T] was known to Gotsman and Linial. The first non-trivial bounds 
for higher degrees were obtained independently by [7] and [3] , who later combined their papers into 
[3]. They essentially proved bounds on average sensitivities of Od{n^~^^^^'^^) and bounds on noise 
sensitivities of Od{S^^^^'^^)- For the special case of Gaussian noise sensitivity, the author proved 
essentially optimal bounds in [lOj of 0{d\^). More recently, in [9], the author managed to use this 
result to get an improved estimate for the Bernoulli case giving a bound on average sensitivity of 
Oc,rf(n^/^"'"'^) for any c > 0, for the first time obtaining an exponent of n bounded away from 1 even 
as d goes to infinity. In this work, we improve this bound further, yielding the correct exponent. 

1.2 Overview of our Technique 

We begin with a very high level overview of our technique. A somewhat more detailed overview can 
be found below in Section 13.11 Very roughly, our bound is obtained via a recursive bound in terms 
of n. We begin by splitting our coordinates into b roughly equally sized blocks (for b = n^/®^'^)). 
The average sensitivity is then the sum over blocks of the expected average sensitivity of a random 
restriction of the function to a block. Our bound will follow from the claim that on average all but 
0{Vb) of these blocks correspond to polynomials with standard deviations much smaller then their 
means, and thus have constant sign with high probability. This result is obtained by considering the 
relative sizes of p and its derivative at random points. Using the idea of strong anticoncentration 
from [8] (see Lemma [9] below), we know that on Gaussian inputs that p is likely not much smaller 
than its derivative. We bring this result into the Bernoulli setting by way of an invariance principle 
and regularity lemma, completing the proof. 

This paper is organized as follows. In Section [2l we provide some notation and basic results. 
In Section [3l we provide a more detailed version of the above, providing a sketch of a proof of the 
weaker bound AS(/) < -y/nexp (0((i log log(n))^) . We then discuss the modifications necessary to 
obtain our stronger bound, and introduce some additional tools. Finally in Section |H we prove 
Theorem [2j 

2 Background and Notation 
2.1 Notation 

Throughout we will use X, Y, Z to represent standard multidimensional Gaussian random variables 
and A,B,C to represented standard multidimensional Bernoulli variables unless otherwise specified. 
For a function / : M" — )• R, and a vector v G M", we let Dyf{x) be the directional derivative of / 
at X in the direction of f , or equivalently, Dyf{x) = v ■ V/(x). For completeness, we formally state 
the definition of a polynomial threshold function: 

Definition. A function f : M" — )■ M is a (degree-d) polynomial threshold function if it is of the 
form 

f{x) = sgn{p{x)) 
for some (degree-d) polynomial p : M" — )• M. 
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2.2 Polynomials with Random Inputs 

Here we review some of the basic distributional results about polynomials evaluated at random 
Gaussian or Bernoulli inputs. To begin with we define the standard L* norms: 

Definition. // / : M" — )■ M is a function and t > 1 is a real number we let 

\fU={E[\f{X)f]f\ \f\B,t = {E[\f{A)\']Y^\ 

Recall that above X is a standard n- dimensional Gaussian and A a standard n-dimensional Bernoulli 
random variable. 

The following Lemma relating the norms will prove to be important: 
Lemma 4. If p is a multilinear polynomial then 

\P\2 = \p\b,2 

Proof. This follows immediately upon noting that the polynomials of the form Jlies ^« subsets 
S C {1, . . . ,n} form an orthonormal basis for the set of multilinear polynomials with respect to 
both the inner product defined by the Gaussian measure and the inner product defined by the 
Bernoulli measure. □ 

One of the most important results on the distribution of the values of polynomials is the hyper- 
contractivity result which relates the values of higher moments to the second moment. In particular, 
the following follows from results of [l] and [13] : 

Lemma 5. Let p be a polynomial of degree-d and t >2 a real number. Then 

\p\t < Vt - l'^\p\2, \p\B,t < Vt - 1'^\p\b,2- 

These bounds on higher moments allow us to prove concentration bounds on the distribution 
of our polynomial. In particular, we have the following corollary (see [9]): 

Corollary 6. For p : M" — t- M a degree-d polynomial N > 0, then 

Pr{\p{X)\ > N\p\2) = O (2-W2f'') , Pr{\p{A)\ > N\p\b,2) = O (2-^2^'') . 

In addition to this concentration result, we will also need some anticoncentration results (i.e. 
results that tell us that the value of p does not lie in a small interval with too large a probability) . 
For starters, applying the Paley-Zygmund inequality (see |14] ) to p'^, we obtain the following result, 
which we call "weak anticoncentration": 

Corollary 7 (Weak Anticoncentration). Let p be a degree-d polynomial in n variables. Then 

Pr{\p{X)\ > \p\2/2) > 9-^2, Pr{\piA)\ > \p\b,2/2) > 9"'/2. 

While the bounds in Corollary [7] are fairly weak, not much more can be said in the Bernoulli 
case. In particular, it is not hard to demonstrate non-zero, degree-d polynomials p so that p{A) = 
with probability 1 — 2~'^. On the other hand, in the Gaussian case it can be shown that the output 
of p is bounded away from zero with large probability. In particular, we have the following result 
of Carbery and Wright (|2]): 
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Lemma 8 (Carbery and Wright). If p is a degree-d polynomial and e > then 

Pr{\p{X)\<e\p\2) = 0{de'/''). 

Perhaps more importantly though for our purposes the idea of strong anticoncentration, intro- 
duced in [8], which relates the size of a polynomial to its derivative. In particular we will need: 

Lemma 9 (Strong Anticoncentration). Let p be a non-zero degree-d polynomial and e > 0, then 

Pr{\p{X)\<e\DYp{X)\)=Oid\). 

Proof. For real number 6 let 

Xg = cos{e)X + sm{e)Y, 
Yg = - sm{9)X + cos{e)Y. 

We note for any 9 that Xg and Yg are independent standard Gaussians. Taking 6 to be uniformly 
distributed over [0,27r], we have that 

Ft{\p{X)\ < e\DYp{X)\) = FT{\p{Xg)\ < e\DY,p{Xg)\) 

d 



Pr \p{Xg)\ < e 



dd 



(piXe)) 



Ex. 



Y 



Pie \p{Xg)\ < e 



d_ 
'do 



(piXe)) 



We claim that for any X, Y that do not leave p{Xg) identically that the inner probability is 
0{d^€). We may write p{Xg) as a degree-d polynomial in sin(0) and cos{9). Thus we may write 
p{Xg) as e~*'^^g(e*^) for some polynomial q of degree at most 2d. Letting z = e*^ we have that 



%ip{Xg))\ 

\p{Xe)\ 



-dz-'^ + 



q{z) 



<d + 



j'{z) 



q{z) 



Now if e > l/(2ci), we have nothing to prove. Otherwise, it suffices to bound the probability 
that the logarithmic derivative of g at z has absolute value at most l/(2e). We may factor q as 
q{z) = cY\^-^^{z — ri) where g <2d and c, rj are some complex numbers. We have that 



q'{z) 




^ 1 


q{z) 




E 

^-^ z — ri 

i=l 



< 



2d 



mm,- \z 



Hence we have that \p{Xe)\ < e\^{p{Xg))\ only if | z — r,| < Ade for some i. By the union bound 
over i, this happens with probability at most 2dO{4:de) = 0{d'^e). This completes our proof. 

Remark. A tighter analysis will actually achieve a bound of 0{d\og{d)e), which is optimal. 



□ 



Finally, we will need a single result on the average size of the derivative of a polynomial. In 
particular the following follows from results in [9]: 

Lemma 10. For p a degree-d polynomial, then 

Var{p{X)) < E[\DYp{X)f] = E[\Vp{X)f] < dVar{p{X)). 
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2.3 Sensitivity and Influence 

We now define the i^^ influence of a function on the hypercube. 

Definition. /// : {—1, 1}" — )• M and i is an integer between 1 and n, we define 

InMf) = EA[VarA,{fiA))]. 

This is the average over ways of picking the values of all coordinates except for the i*^ of the variance 
over the i*^ coordinate of f. Alternatively it is 

h.[\f{A)-f{A^r] 

where A^ is obtained from A by negating the i*'^ coordinate. Finally, if f is given as a multilinear 
polynomial on M" it is not hard to show that 



df 2 



dxi 



2 

The last definition may be combined with Lemma [TU] to obtain the following Corollary: 
Corollary 11. If p is a multilinear, degree-d polynomial in n variables, then 



Var{p{A)) <^Infi{p) < dVar{p{A)). 



1=1 

An important notion is that of regularity of a polynomial, which is a measure of how much 
influence any one coordinate can have on the output. We recall: 

Definition. We say that a polynomial p is r-regular for some t > if 

Infiip) < TVar{p{A)) 

for all i. 

We also recall the definition of the average sensitivity (also known as the total influence) of a 
Boolean function. 

Definition. /// : {-1,1}" ^ {-1,1} then 

n 

AS(/) ■.= Y,Infi{f)- 

i=l 

Finally, we define some functions to keep track of the maximum possible average sensitivity of 
a polynomial threshold function of a given dimension, degree, and amount of regularity. 

Definition. If d,n,T > are real numbers we let MAS{d, n) be the maximum over polynomial 
threshold functions f of degree at most d and dimension at most n o/ AS(/). We let MRAS{d, n, r) 
be the maximum over such functions f where additionally f{x) = sgn{p{x)) for p a degree-d, r- 
regular polynomial of AE{f ). 
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2.4 Invar iance and Regularity 

An important tool for us will be the invariance principle of [12], which relates the distribution of a 
polynomial under Gaussian input to its distribution under Bernoulli input. In particular, we have: 

Theorem 12 (The Invariance Principle (Mossel, O'Donnell, and Oleszkiewicz)). Ifp is a r-regular, 
degree-d multilinear polynomial, and t G M, then 

\Pr{p{X) <t)- Pr{p{A) <t)\= 0(dTi/(^'^)). 

We will need a theorem similar to Theorem [T2j The following is proved by nearly identical 
means to Theorem 1121 

Proposition 13. Let p and q be degree-d, multilinear polynomials inn variables. Suppose for some 
r > that Inf{p), Inf{q) < r for all i. Suppose furthermore that \p + q\2, \p — q\2 > 1- Then 

Pr{\piA)\ < \q{A)\) = Pri\p{X)\ < |g(X)|) + 0(dri/(8«^)). 

Proof. We note that it suffices to prove only that 

Pr(|p(^)| < |g(^)|) < Pr(|p(X)| < |g(X)|) + 0(dri/(8'^)) 

and to note that the other direction follows from interchanging p and q. 
Note that 

Pr(|p(^)| < \q{A)\) = Pr{\p{A)\ < q{A)) + Pr(|p(^)| < 

and 

Pt{\p{X)\ < \q{X)\) = Fr{\p{X)\ < q{X)) + Pr(|p(X)| < -q{X)), 
it suffices to show that 

Pr(|p(^)| < q{A)) < Pr(|p(X)| < + 0{dT^/^^''^). 

Letting r = q — p and s = q + p, we need to show that 

Pr(r(A) > and s{A) > 0) < Pr(r(X) > and s{X) > 0) + 0((ir^/(^'^)), (1) 

where r and s are polynomials of degree-d, norm at least 1, and maximum influence at most r. 
By rescaling r and s, we may assume that \r\2 = \s\2 = 1- 

Let yO be a smooth function so that p{x) = 1 for x > 0, p{x) = for x < —t^^^, and < p{x) < 1 
for all X. We note that such p can be found with = 0{t~^^^) for all x and all 1 < A; < 3. 

Define 

g{x) := ip{r{x), s{x)) := p{r{x))p{s{x)). 
Since g{x) = 1 whenever r and s are both positive, 

Pr(r(^) > and s{A) > 0) < E[g{A)]. 

We claim that 

\E[g{A)]-E[g{X)]\ < 2^('^V^/^ 

This follows immediately from Theorem 4.1 of |11) . noting that B = 0{t~^^^). Notice that if 
r > d^^ that we have nothing to prove and that otherwise 2'^('^V^/^ = 0{dT^^^^'^^). 
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We now need to bound the expectation of g{X). We note that g{X) is unless r{X), s{X) > 
_^i/8^ This can happen only if either both are positive or at least one has absolute value at most 
r^/s. Thus 

E[g{X)] < Fi{r{X) > and s{X) > 0) + Pr(|r(X)| < r^/*) + Fr{\s{X)\ < r^/^). 
By Lemma [HI this is at most 

Pr(r(X) > and s{X) > 0) + 0{dT^^^^'^^). 

Thus, 

Pr(r(^) > and s{A) > 0) < E[g{A)] 

< Pr(r(X) > and s{X) > 0) + 0((ir^/(^<^)). 

This proves Equation ([1]), and completes our proof. □ 

The invariance principle will turn out to be very useful to apply to regular polynomials, but for 
general polynomials we will need a way to reduce to this case. For this purpose we can make use 
of the following result of [5]: 

Theorem 14 (Diakonikolas, Servedio, Tan, Wan). Let f{x) = sign{p{x)) he any degree-d PTF. 
Fix any r > 0. Then f is equivalent to a decision tree T , of depth 

depth{d,T) = - ■ ((ilog(r-i))^('^) 
r 

with variables at the internal nodes and a degree-d PTF fp = sgn{pp) at each leaf p, with the 
following property: with probability at least 1 — t, a random path from the root reaches a leaf p such 
that fp is T-close to some r-regular degree-d PTF. 

Unfortunately, for our purposes, we will also require a stronger version of this Theorem. 

Proposition 15. Let p be a degree-d polynomial on the hypercuhe and let 1/4 > r, e,5 > he real 
numbers. Then p can he written as a decision tree of depth at most 

D = {dlog{T-')log{e-')f^'^ log(5-i) 

with variables at the internal nodes and a degree-d polynomial threshold function fp = sgn{pp) at 
each leaf p, with the following property: that for a random leaf, p, with probability 1 — 5 we have 
that Pp is either r-regular, or constant sign with probability at least 1 — e. 

Proposition [15] will follow from repeated application of the following Lemma. 

Lemma 16. Letp be a degree-d polynomial on the hypercube and /et 1/4 > r, e > 6e real numbers. 
There exists a set S of coordinates with 

\S\<T-^ (dlog(T-i)log(e-i))^(") 

so that after assigning random values to the coordinates of S, with probability at least 2~^^'^^ over 
the choice of assignments, the restricted polynomial Pp is either r-regular or has constant sign with 
probability at least 1 — e. 
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Proof. We assume without loss of generality that p is multilinear with \p\2 = 1. We take S to 
simply be the set of all coordinates of influence more than 



T((ilog(T-i)log(e"i))^ 



Aid 



for M a sufficiently large constant. We have that IS"! will be of the appropriate order since the 
total influence of p is at most d. We claim that with probability at least 2~^^'^^ that both of the 
following hold: 

\Pp\l > 1/2. (2) 

max(Infi(pp)) < T{A\og{e-^))-''/^ (3) 
i 

As for Equation ([2]), we note that \pp\2 is a polynomial of degree at most 2d in the assignments of 
the coordinates in S. Furthermore its expectation is Therefore, the norm of this polynomial 
is at least \p\2 = 1, and hence by Corollary [71 Equation ([2]) holds with probability 2~'-^^'^\ We 
now need to show that Equation ([3]) fails to hold with at most half of this probability. We note 
that for each i that Infj(pp) is the sum of squares of degree-d polynomials in the assignments of 
coordinates of S, and has mean value Infj(p). Thus it is given by some degree-2(i polynomial, q with 
|g|i = Infi(p). By Corollary El \qi\i > 2-^W|5i|2/2, and thus \qi\2 = 2<^('^)lnfi(p). Now, for each 
i ^ S, Infj(p) < r((ilog(r~^) log(e^^))^^'^'^ := m. By Corollary [6l we have that for M sufficiently 
large 



Pr (inUp,) > r(41og(e-i))-'^/2) < m2-^"exp l-d (j^) 



Since there are at most d2'^m ^ coordinates i for which Infj(p) G [m2 ^,m2 ^^'^], the probability 
that any coordinate of pp has too large an influence is at most 



00 00 

^d2'=m-V2-^"exp(-d2('=-i)/'^) < d2-*"2«W ^ 2^^^ exp (-d2^) < 2«Wd2- 

k=l i=0 



■Md 



which is sufficiently small. 

Now if Equations [2] and [3] both hold, then either Var(pp) > (41og(e~^))^'^/^, in which case pp is 
r-regular, or Var(pp) < (41og(e~-'^))~'^/^. In the latter case, since 1/2 < \pp\2 = Var(pp) +E[pp]^, we 
have that letting fi = E[pp] that |^| > 1/2. Furthermore, \pp — ^{2 = Var(pp) < (41og(e~^))~'^/^. 
Therefore, by Corollary [6l we have with probability at least 1 — e that 

\Pp{A) - /i| < |^|. 

And thus with probability at least 1 — e, pp has the same sign as fi. This completes our proof. □ 

Proposition [T5l now follows from applying the construction in Lemma [16] repeatedly to the leaves 
that do not yet satisfy one of the necessary conditions up to a total of at most 2^^'^^ log(5~^) times. 

3 Overview of our Technique 
3.1 Proof of a Simpler Bound 

We begin by providing a somewhat detailed sketch of a proof of the slightly weaker bound that 

MAS(d,n) < \/^exp (0(dlog log(n))2) . 
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Starting with a polynomial threshold function / = sgn(p(x)) for p a degree-d multilinear polynomial 
threshold function in n variables, we begin by using Theorem [TH to reduce to the case where 
p is n~^/^-regular, introducing an error of \/^0((ilog(n))<^W in the process. We then split the 
coordinates into b blocks of roughly equal size for b = n^/^^'^^ , and note that the sensitivity of / is 
the sum over blocks of the sensitivity of / randomly restricted to a function on only that block of 
coordinates. We note that by Corollary [6] that if any of these restrictions have an expected value 
that exceeds their standard deviation by a factor of more than about log(n)'^/'^, then the polynomial 
will have constant sign with high probability and can thus be ignored. We call a block for which 
this does not happen good. 

We thus have that the average sensitivity of / is bounded by the expected number of good 
blocks times MAS{d,n/b). It is not hard to show that a polynomial q with standard deviation at 
least log(n)~'^/^ times the absolute value of its expectation, has a reasonable probability of having 

^>2-^(^)log(n)-/^ 

This allows one to bound the expected number of good blocks in terms of the expectation of 

™ [-mr) ) ■ 

Or more tractably, in terms of the expectation of 

On the other hand, we can use Lemma [9] and Proposition [13] to show that 

for each k. This lets us bound the expected number of good blocks by 0(log(n))'^\/6. This provides 
us with a recursive bound for the average sensitivity, which comes out to roughly 

MAS(d,n) < 0(log(n))V/(i'^'^)MAS((i,ni-i/(S'^)), 

which gives the bound required. 

Unfortunately, in the above argument, the requirement that we only consider whether or not a 
block is good has cost us a factor of log(n)'^ at each recursive step, yielding a bound off by a factor 
of exp((i^ log log(n)^). By being less strict with our reductions, we can instead lose only a poly(d) 
factor at each step, yielding a bound with only polylogarithmic error. In order to do this, instead 
of simply considering whether or not a block is good, we consider more detailed information about 
the ratio of its value and its derivative at a random point. To do this we will need to introduce 
some new machinery, which we do in the next Section. 

3.2 The a Function 

The following will prove to be a key concept for our analysis: 
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Definition. For p a non-zero polynomial we let a{p) be defined by 



Similarly, let 



a{p) := E 



m := E 



min 1 



mill I 1 



We will bound the noise sensitivity of a polynomial threshold function in terms of a{p). First 
we introduce some notation: 

Definition. Let MASa{d, n, a) be the maximum average sensitivity of a polynomial threshold func- 
tion f{x) = sgn{p{x)) where p is a polynomial of degree at most d in at most n variables with 
a{p) < a. 

Let MRASa{d, n, a, r) be the maximum average sensitivity of a polynomial threshold function 
f{x) = sgn{p{x)) where p is a r-regular polynomial of degree at most d in at most n variables with 
oi{p) < Cl- 
in particular, we will prove: 
Proposition 17. 

MASa{d,n,a) < aV^(log(n))0('^i°sW)20('='' i°sW). 
Theorem [2] will follow as an immediate Corollary of Proposition 1171 

We will require a version of Lemma [9] that takes /3(p) into account. In particular, we use the 
following: 

Lemma 18. Let p be a degree-d polynomial in any number of variables and 1 > e > a real 
number. Then 

Pr{\p{X)\ < e\DYp{X)\) = 0{d'f3{p)e)- 

Proof. If e > d~^, the result follows from the fact that 

Pr(|p(X)| < e\DYp{X)\) < Pr(|p(X)| < \Dyp{X)\) < (3{p). 

Thus we may assume that e < d~^. 

For random X,Y , let g{9) = p(cos{9)X + s'm{9)Y). By the proof of Lemma El we have that the 
probability in question is 



FTi\gi9)\ < e\g'{9)\) < 0{dh)Prx,Y (^^9 : 



\9'{0)\ ^ ^-1 



m\ 



We note that g{9) = X^m=-d some constants am- If it is the case that |ao| > 2 ^^.^q 

then \g{9)\ > \ao\/2 for all 9, and 



\9'm 



< d|ao|/2. 



Thus, in this case, \g' {9)\/\g{9)\ < d < e ^ for all 9. Thus the probability in question is at most 



Oidh)Prx,Y \ao\ < 2 ^ 
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When |ao| <2J2m^o 



Oml, we have \g{0)\ < ^ \am\ for all 9, and the average value of \g'{9)\'^ is 




Thus, \g' {9)\/\g{9)\ > l/{8d) with constant probability. Hence we have that 



Pr(|p(X)| < e\DYp{X)\) = 0{dh)FT{\p{X)\ < 8d\DYp{X)\) 



= 0{dh)Fr{\g{9)\<8d\g'{9)\) 

= 0{d^e)Pr{\p{X)\<8d\DYp{X)\) 

= Oid^/3{p)e). 



□ 



4 Proof of the Main Theorem 

In this Section, we prove Proposition [T7] and thus Theorem [2l We begin in Section 14.11 by proving 
a recursive bound on average sensitivity for regular polynomial threshold functions. In Section [4. 2^ 
we show a reduction to the regular case. Finally, in Section [4.31 we combine these recursive bounds 
to obtain a proof of Proposition [T71 

4.1 The Regular Case 

Here we prove the reduction in the case of a regular polynomial. In particular, we show: 
Proposition 19. Let d,n,T,a > be real numbers and let b < n be a positive integer. Then 



Proof. Consider / = sgn(p(x)) for p a r-regular, degree-d, multilinear polynomial in at most n 
dimensions with Var(p(^)) = 1 and a{p) < a. It suffices to show that for all such / that 



for an appropriate bJ;. 

We begin by partitioning the coordinates of / into b blocks each of size at most n/b + 1. For 
each block, and Bernoulli random variable. A, we let be the coordinates of A that do not lie 
in I. We let be the function defined on the coordinates of I obtained by plugging these values 
into p for the other coordinates. We define /^c similarly. It is not hard to see that 



MRASa{d,n,a,T) < bEi^[MASa{d,n/b + 



for some non-negative random variable H with E[H] = 0{d^ab ^^'^ + d^T^I'^^'^^^. 



AS{f) < 6EH[MASa(d, n/b + 1, H)] 



AS(/) = ^E^4AS(/^,)]- 



It thus suffices to show that 




(4) 



i 
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We have that 



e e 



2\ n 



mill 1 



< O I E 



mill ( b, 



mill b 



\DBPMiA)\ 

\p{A)\^ 

\piA)\^ 

( \Dbp{A)\ \' 
'V \P{A)\ J 



We note that 



E 



mill b. 



( \Dbp{A)\ V 
V \P{A)\ J 



mill ^1, 

■b 



( \Dbp{A)\ \- 
V \P{A)\ J 



+ ^ PT(\p{A)\<t-'/^\DBp{A)\)dt 
a{p) + j'pr {\p{A)\ < t-'/^\DBp{A)\) dt. 



To bound the second term above, we use an invariance principle to relate the necessary proba- 
bilities to those in the Gaussian case. In order to do so we define the polynomial q{A, B) = Dbp{A). 
To show that q has small influences we note that 
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= (ilnfi(p) 
< dr. 

Where the middle line above is by Lemma [TOl Thus all of the influences of p and q are at most dr. 
Furthermore, it is easy to see that p and q have covariance (since p is even in B and q is odd in 
terms of B). Thus we have for any real s that 

\p + sq\2 > \p\2 > YaT{p{A)) = 1. 

Therefore by Proposition \T3\ for any real s we have that 

Ft{\p{A)\ < s\Dbp{A)\) = Pr(|p(X)| < s\Dyp{X)\) + Oidr^'^'^^). (5) 



12 



Applying Equation ([5]), we find that 



m = [ Pr(|p(X)| < s~'/^\DYpiX)\)ds 
Jo 

= [ PT{\p{X)\<s-^/^\DYp{X)\)ds + 0{dT'^/^^'^^) 
Jo 

By Equation ([5]) and Lemma [HI we find that 
j''Fr[\p{A)\ <t-^/'^\DBp{A)\^dt = y"*Pr(|p(X)| < t-^/^\DYp{X)\^ dt + 0{dbT^/^^'^'^) 

= / 0{dh-^/^l3{p))dh + 0{dbT^/^^'^^) 

= 0{d^Vbp{p)) + 0(d6ri/(S'^)) 
= 0{d^Vha{p)+d%T^/^^'^^). 

This completes the proof of Equation as desired. □ 
4.2 Reducing to the Regular Case 

In this Section, we show by a simple application of Proposition [15] that the average sensitivity of 
an arbitrary polynomial threshold function can be bounded in terms of the sensitivity of a regular 
one. In particular, we show that: 

Proposition 20. For any d, n, a, r, e > we have that 

MASa{d,n,a) < T-^{dlog{T-^)log{e-^)f^'^^+3ne + En[MRASaid,n,i<,T)], 
for some non-negative random variable H with E[H] = a. 

Proof. Let p be a degree-d polynomial in n variables with a{p) < a. Let / = sgn op. We will show 
that for an appropriately chosen H that 

AS(/) < r-^(dlog(r-i)log(e~^))°(^) +3ne + EH[MRASa(d,n,^^,r)]. 

We begin by writing / as a decision tree as given to us in Proposition [15] with 6 set to e. We 
claim that the average sensitivity of / is at most the depth of the decision tree plus the expectation 
over leaves of the tree of the average sensitivity of the resulting function. To show this we note 
that the average sensitivity of / is equal to the expected number of coordinates, i so that f{A) 
disagrees with /(^*), where A^ is obtained from A by flipping the i^^ coordinate. We compute this 
probability by first conditioning on the path through the decision tree defined by A. Except for a 
number of coordinates that is at most the depth of the tree, flipping the i*^ coordinate leaves us in 
the same leaf. The expected number of such coordinates that we can flip to change the sign of / is 
at most the average sensitivity of the function corresponding to that leaf. The expected number of 
other coordinates is at most the depth of the decision tree. This completes the proof of this claim. 

Thus we have 

AS(/) < r-i(a!log(r-i)log(e-i))^('^) +Eieavesp[AS(/p)]. 
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With probabihty 1 — e, fp is either r-regular or constant sign with probabihty 1 — e. The contribution 
from the remaining e probabihty set of leaves is at most ne, and the contribution from the leaves 
with nearly constant sign is at most 2ne. We thus need to bound the contribution from the leaves for 
which fp is r-regular. This is an expectation of the average sensitivities of the threshold functions 
of r-regular, degree-d polynomials in at most n variables. We have only to show that 

nc^iPp)] < a. 

But this follows immediately from the definition of a. □ 
4.3 Putting it Together 

Here we combine Propositions [19] and [20] to prove Proposition [T71 
First we need a Lemma: 

Lemma 21. Let p be a degree-d multilinear polynomial in n variables and let f{x) = sgn{p{x)). 
There is a constant K, so that if a{p) < {K\og{n))^'^ then 

A§(/) = 0(a). 

Proof. We note that A§(/) is at most 0{n) times the probability that / takes on its less common 
value. We note that by Corollary [7] that with probability at least 2~^^'^'> that 

> E [\Dbp{A)\''] /4 > Var(p)/4. 

Therefore, by the Markov inequality, there is a probability of at least 2^*-^^'^) that this occurs and 
that additionally 

<2«W|p|2. 

Hence it is the case that 

^ < ^^'^'^aip). 

mi 

Let /i = E[p(^)]. We have that 

\p - /i|2 = v'Var(p) < 2°^'^\p\2^/^. 

Since \p\'^ = fjp + Var(p), we also have that 

\p-ti\2 = \/Var(p) < 20W|/i|7^. 

Hence for a{p) < {Klog{n))~'^ for K sufficiently small, we have by Corollary [6] p( A) has the same 
sign as fi with probability 1 — 0{a{p)n~^), yielding our desired bound. □ 

Proof of Proposition \17\ Let r = n~^/^, e = n~^, and n > 2^^^^^ ^"sC'^) for M a sufficiently large 
constant. By Proposition [20] we have that 

MASa((i, n, a) < 0{n^^^) + EN[MRASa(d, n, n'^/^)], 

for some ^^ with E[^!;] = 0(a). Let b = ^n^/^^^^^~\. Applying Proposition [19] to the above, we have 
that 

MASa((i, n, a) < 0{n^/^) + 6EK[MASa(d, n/b + 1, H)] 

< 0(ni/2) + 2ni/(^6'^)EN[MASa(d, 71^-1/(^6'^), b^)]. (6) 



14 



Where above E[H] = ©(d^a^-^/^ _^ (^4^-i/(24d)^_ Notice that either a > log(n))-'^, in which case, 
or a < (i^log(n)) in which case MASa((i, n,a) < v}^^ by Lemma [211 Thus 
in any case, Equation ([6]) holds for some N with E[H] = 0{d?ab~^^'^). 

We now proceed by induction on n. In particular, for a sufficiently large constant M, we prove 
by induction on n that 

MASa(d, n, a) < a^/^(log(n))*^'^i°s(^)2*"' (7) 

We begin by showing this for n < 2*^^^^ For such n, the bound follows from Lemma [2T] and 

the trivial bound of n. 

Next suppose that Equation ([7]) holds for all smaller values of n. Bounding the MASa(d, n,H) 
terms in Equation ([6]) recursively, we obtain 

MASa(d,n,a) < ©(n^/^) + 0(d3)ni/(i6rf)E[^]„i/2-i/(32d) (^^g^^^^^ _ ^/(-^g^))Mdiog(d)2Md2 iog{d) 

= 0(ni/2) + aO(d3)^(log(n))^'"i°s('^)d-^W2*^^''°s('^) 
= 0(ni/2) + aV^(log(n))*"'°s(^)0(d=^-^(*^))2*"'i°gW 

Where the last line holds when M is sufficiently large. Now, if a > (i^ log(n))~'^, then this is at 
most a-y/n(log(n))*^'^'°s(°'^2*^°'^^°sW^ as desired. If on the other hand, a < {K log{n))^'^ , the same 
bound follows instead from Lemma [211 In either case we have 

MASa(d, n,a) < a^/^log(n)^"i°e('^)2*^'^'^°gW. 

This completes our inductive step and finishes the proof. □ 

5 Concluding Remarks 

We believe that using techniques from [9j, that the bound on average sensitivity can be improved 
to 

V^Od(log(n))0(i°s(<i)). 

The basic idea would be to use the diffuse regularity lemma and invariance principle instead of the 
standard ones in the proof above. This allows us to take a number of blocks, b polynomial in n 
rather than n^/®^'^\ This decreases the number of rounds in our recursion by a factor of d, and thus 
lowers the asymptotic exponent by a corresponding factor. Unfortunately, the poor dependence on 
degree in the technology from [9], means that this bound will have perhaps a very bad dependence 
on d. 

Although it seems that for fixed d we have obtained nearly the correct asymptotic in terms 
of n, our dependence on d is still fairly bad. In particular, the bound given in Theorem [2] does 
not improve upon the trivial bound of n until log(n) ^ d'^log{d). The reason for this is that in 
our inductive step, we wish to replace ab~^^'^ + dr^^^^'^^ by 0(a6^^/^), so long as a > (i^ log(n))^'^ 
(since otherwise we can use simpler bounds). On the other hand, using the easily established 
bound MASa((i, n, a) = 0(na), it is not hard to prove the bound AS(/) < 0{d^)n^-^/^^^'^\ which 
is non-trivial for n = 2<^('^i°g('^)). 
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